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Abstract. We study decompositions of NVALUE, a global constraint that can 
be used to model a wide range of problems where values need to be counted. 
Whilst decomposition typically hinders propagation, we identify one decomposi- 
tion that maintains a global view as enforcing bound consistency on the decom- 
position achieves bound consistency on the original global NVALUE constraint. 
Such decompositions offer the prospect for advanced solving techniques like no- 
good learning and impact based branching heuristics. They may also help SAT 
and IP solvers take advantage of the propagation of global constraints. 



1 Introduction 

Global constraints are an important feature of constraint programming. They capture 
common patterns in real world problems, and provide efficient propagators for pruning 
the search space. Consider, for example, the NVALUE constraint which counts the num- 
ber of values used by a set of variables [1]. This global constraint can model problems 
where values represent resources. This is a common constraint that can be used to model 
many practical problems such as timetabling and frequency allocation. Whilst enforc- 
ing domain consistency on the NVALUE constraint is NP-hard [2], bound consistency is 
polynomial to achieve. At least four different propagation algorithms for the NVALUE 
constraint have been proposed, some of which achieve bound consistency [3-5]. 

We have recently proposed simulating propagators for global constraints with de- 
compositions. For instance, we have shown that carefully designed decompositions of 
the global All-Different and GCC constraints can efficiently simulated the corre- 
sponding bound consistency propagators [6]. We turn now to the NVALUE constraint. 
We study a number of different decompositions, one of which permits the achievement 
of bound consistency on the NVALUE constraint. Such decompositions open out a num- 
ber of promising directions. For example, they suggest schema for learning nogoods. As 
a second example, such decompositions may help construct nogood and impact based 
branching heuristics. As a third and final example, such decompositions may permit 
SAT and IP solvers to take advantage of the inferences performed by the propagators 
of global constraints. We have, for instance, seen this with our decompositions of the 
All-Different constraint [6]. 



2 Background 



We assume values are taken from the set 1 to d. We write dom(Xi) for the domain 
of possible values for JQ, min(Xi) for the smallest value in dom(Xi), max(Xi) for 
the greatest, and range(Xi) for the interval [min(Xi),max(Xi)]. A global constraint 
is one in which the number of variables n is a parameter. For instance, the global 
N Value ( [X x , ... ,X n ], N) constraint ensures that N = \{X { \ 1 < i < n}\ [1]. 
Constraint solvers typically use backtracking search to explore the space of partial as- 
signments. After each assignment, propagation algorithms prune the search space by 
enforcing local consistency properties like domain, range or bound consistency. A con- 
straint is domain consistent (DC) iff when a variable is assigned any of the values in 
its domain, there exist compatible values in the domains of all the other variables of 
the constraint. Such an assignment is called a support. A constraint is disentailed iff 
there is no possible support. A constraint is range consistent (RC) iff, when a variable 
is assigned any of the values in its domain, there exist compatible values between the 
minimum and maximum domain value for all the other variables of the constraint. Such 
an assignment is called a bound support. A constraint is bound consistent (BC) iff the 
minimum and maximum value of every variable of the constraint belong to a bound 
support. We will compare local consistency properties applied to sets of constraints, c\ 
and c 2 which are logically equivalent. As in [7], a local consistency property <P on c\ is 
as strong as <P on c 2 iff, given any domains, if <P holds on c\ then W holds on c 2 ; <P on 
ci is stronger than i' on c 2 iff <P on c x is as strong as ^ on c 2 but not vice versa; <P on 
ci is equivalent to ^ on c 2 iff <P on c x is as strong as 9 on c 2 and vice versa. Finally, 
as constraint solvers usually enforce local consistency after each assignment down a 
branch in the search tree, we will compute the total amortised cost of enforcing a local 
consistency down an entire branch of the search tree. This captures the incremental cost 
of propagation. 



3 NValue constraint 



Pachet and Roy proposed the N Value constraint (called by them the "cardinality on at- 
tribute values" constraint) to model a combinatorial problem in selecting musical play- 
lists [1]. It can also be used to model the number of frequencies used in a frequency 
allocation problem or the number of rooms needed to timetable a set of exams. It gen- 
eralizes several other global constraints including All-Different (which ensures a 
set of variables take all different values) and Not-All-Equal (which ensures a set of 
variables do not all take the same value). Enforcing domain consistency on the NValue 
constraint is NP-hard (Theorem 3 in [2]) even when N is fixed (Theorem 2 in [4]). In 
fact, computing the lower bound on TV is NP-hard (Theorem 3 in [8]). In addition, en- 
forcing domain consistency on the NVALUE constraint is not fixed parameter tractable 
since it is in the W [2] -complete complexity class along with problems like minimum 
hitting set (Theorem 2 in [9]). However, a number of polynomial propagation algo- 
rithms have been proposed that achieve bound consistency and some closely related 
levels of local consistency [3-5]. 



3.1 Simple decomposition 



We can decompose the NValue constraint by introducing 0/1 variables and posting 
the following constraints: 

Xi=j^Bj = l Vl<i<n, l<j<d (1) 
Bj = 1 - V?=i *i =3 VI < j < d (2) 
E7=i Bj = N (3) 

Unfortunately, this simple decomposition hinders propagation. It can be BC whereas 
BC on the corresponding NValue constraint detects disentailment. 

Theorem 1 BC on NValue is stronger than BC on its decomposition into (1) to (3). 

Proof: Clearly BC on NVALUE is at least as strong as BC on the decomposition. To 
show strictness, consider X 1 G {1, 2}, X 2 G {3, 4}, Bj G {0, 1} for 1 < j < 4, and 
N = 1. Constraints (1) to (3) are BC. However, the corresponding NVALUE constraint 
has no bound support and thus enforcing BC on it detects disentailment. □ 

We observe that enforcing DC instead of BC on constraints (1) to (3) in the example 
of the proof above still does not prune any value. To decompose NVALUE without 
hindering propagation, we must look to more complex decompositions. 



3.2 Decomposition into AtMostNValue and AtLeastNValue 

Our first step in decomposing the NVALUE constraint is to split it into 
two parts: an AtMostNValue and an AtLeastNValue constraint. 
ATLEASTNVALUE(pfi,...,X„],AO holds iff N < \{Xi\l < i < n}\ whilst 
AtMostNValue([Xi, ...,X n ],N) holds iff \{X t \l < i < n}\ < N. 

Running Example. Consider a NVALUE constraint over the following variables and 
values: 
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3 4 5 
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Suppose we decompose this into an AtMostN VALUE and an AtLeastN VALUE con- 
straint. Consider the ATLEASTNVALUE constraint. The 5 variables can take at most 
4 different values because X 2 , X3, X4, and X 5 can only take values 2, 3 and 4. Hence, 
there is no bound support for N = 5. Enforcing BC on the ATLEASTNVALUE con- 
straint therefore prunes N = 5. Consider now the AtMostN VALUE constraint. Since 
X 2 and X4 guarantee that we take at least 2 different values, there is no bound support 
for N = 1. Hence enforcing BC on an AtMostN VALUE constraint prunes N = 1. If 
X\ = 1, 3 or 5, or X$ = 3 then any complete assignment uses at least 3 different val- 
ues. Hence there is also no bound support for these assignments. Pruning these values 



gives bound consistent domains for the original N VALUE constraint: 
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To show that decomposing the NValue constraint into these two parts does not 
hinder propagation in general, we will use the following lemma. Given an assignment 
S of values, card(S) denotes the number of distinct values in S. Given a vector of 
variables X = X\ . . .X n , card^(X) = max{card(S) | S E IJxiexrangelXi)} and 
cardi(X) = min{card(S) | S E IIx i exrange(Xi)}. 

Lemma 1 (adapted from [5]) Consider NValue([Xl, . . . , X n ],N). If D(N) C 
[cardi(X), card^{X)\, then N is BC. 

Proof: Let S raj „ be an assignment of X in lJx i exrange(Xi) with card{S m i n ) = 
cardi (X) and S max be an assignment of X in IIx i ex'''ange(X i ) with card(S max ) = 
card-\(X). Consider the sequence S m i n = So, S\, . . . , S n = S max where Sk+i is the 
same as Sk except that Xk+i has been assigned its value in Sraax instead of its value in 
Smin- \card(Sk+i) — card(Sk)\ < 1 because they only differ on Xk+i- Hence, for any 
p E [cardi(X), card^ (X)}, there exists fc E l..n with card(Sk) = P- Thus, (Sk,p) is a 
bound support for p on NValue([Xl, . . . , X n ],N). Therefore, min(N) and max(N) 
have a bound support. □ 

We now prove that decomposing the NValue constraint into AtMostNValue 
and AtLeastNValue constraints does not hinder pruning when enforcing BC. 

Theorem 2 BC on NValue([Xi, . . . , X n ], N) is equivalent to BC on 
AtMostNValue([Xi, . . . , X n ] , N) and on AtLeastNValue([A"i, . . . , X n ], N). 

Proof: Suppose the AtMostNValue and AtLeastNValue constraints are BC. 
The AtMostNValue constraint guarantees that cardi(X) < min(N) and the 
AtLeastNValue constraint guarantees that card^{X) > max(N). Therefore, 
D(N) E [card[ (X), card^ (X)]. By Lemma 1, the variable N is bound consistent. 

Consider a variable/bound value pair X t = b. Let (Sf east ,pi) be a bound sup- 
port of Xi = b in the AtLeastNValue constraint and (S most ,p2) be a bound sup- 
port of Xi = b in the AtMostNValue constraint. We have card(S\ east ) > pi 
and card(S most ) < p 2 by definition of AtLeastNValue and AtMostNValue. 
Consider the sequence Sf east — Sq, S±, . . . , S 1 ^ = S most where Sl +1 is the same 
as Si except that Xk+i has been assigned its value in S most instead of its value in 
Sf east . \card(Sl +1 ) — card(Sl)\ < 1 because they only differ on Xk+i- Hence, 
there exists k E l..n with min(pi,p2) < card(S\) < max(pi,p2). We know 
that pi and p 2 belong to range(N) because they belong to bound supports. Thus, 
card(Sl) E range(N) and (S^, card{S\)) is a bound support for Xi = b on 
NVALUE([Xi,...,X„],AO. □ 

When enforcing domain consistency, Bessiere et al. [5] noted that decomposing the 
NValue constraint into AtMostNValue and AtLeastNValue constraints does 



hinder propagation, but only when dom(N) contains just cardi (X) and card^ (X) and 
there is a gap in the domain in-between (see Theorem 1 in [5] and the discussion that 
follows). When enforcing BC, any such gap in the domain for N is ignored. 



4 AtMostNValue constraint 



We now give a decomposition for the AtMostN Value constraint which does not hin- 
der bound consistency propagation. To decompose the AtMostNValue constraint, 
we introduce 0/1 variables, A au to represent whether X^ uses a value in the interval 
[I, it], and "pyramid" variables, Mi u with domains [0, min [u — I + 1, n)] which count 
the number of values taken inside the interval [I, u]. To constrain these introduced vari- 
ables, we post the following constraints: 

Ann = 1 Xi G [I, u] \Jl<i<n,\<l<u<d (4) 

A iiu < Miu Vl<i<n,l<l<u<d (5) 

M lu = M lk + M (k+1)u V 1 < k < u < d (6) 

M ld < N (7) 



Running Example. Consider the decomposition of an AtMostNValue constraint 
over the following variables and values: 





1 2 3 4 5 
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Observe that we consider that value 5 for N has already been pruned by 
ATLEASTNVALUE, as will be shown in next sections. Bound consistency reasoning 
on the decomposition will make the following inferences. As X 2 = % from (4) we 
get A 2 22 = 1- Hence by (5), M 22 = 1. Similarly, as X 4 = 4, we get A 444 = 1 
and M 44 = 1. Now N e {1,2}. By (7) and (6), M 15 < N, M 15 = M 14 + M 55 , 
M M = M13 + M 44 , M13 = M12 + M 33 , M12 = Mn + M 22 . Since M 22 = M 44 = 1, 
we deduce that N > 1 an<f hence N — 2. 77z/s §/ves Mn = M33 = M55 = 0. By (5), 
Am = A133 = A155 = A533 = 0. Finally, from (4), we get X\ = 2 and X^ = 3. 77zw 
gives ms bound consistent domains for the AtMOSTN VALUE constraint. 

We now prove that this decomposition does not hinder propagation in general. 

Theorem 3 BC on constraints (4) to (7) is equivalent to BC on AtMostNValue 
([Xi, . . . , X n ],N), and takes 0(nd 3 ) time to enforce down the branch of the search 
tree. 

Proof: First note that changing the domains of the X variables cannot affect the upper 
bound of N by the AtMostNValue constraint and, conversely, changing the lower 
bound of N cannot affect the domains of the X variables. 



Let Y = {X P1 , . . . , X Pk } be a maximum cardinality subset of variables of X whose 
ranges are pairwise disjoint (i.e., range(X Pi ) n range(X Pj ) — 0, Mi, j 6 l..k, i ^ j). 
Let Iy = {[h, Ci] | bi — min(X Pi ), Ci — max{X Pi ) 1 X Pi e Y} be the corresponding 
ordered set of disjoint ranges of the variables in Y. It has been shown in [4] that \ Y\ = 

cardi(X). 

Consider the interval [bi, Ci] 6 Iy. Constraints (5) ensure that the variables M biCi 
i = [1, . . . , k] are greater than or equal to 1 and constraints (6) ensure that the vari- 
able Mid is greater than or equal to the sum of lower bounds of variables M biCi , 
i = [1, . . . , k], because intervals [bi, Ci] are disjoint. Therefore, the variable N is greater 
than or equal to cardi (X) and it is bound consistent. 

We show that when N is BC and dom(N) ^ {cardi(X)}, all X variables are BC. 
Take any assignment S E lJx i exrange(X i ) such that card(S) — cardi(X). Let 
S[Xi <— b] be the assignment S where the value of Xj in S has been replaced by b, one 
of the bounds of X { . We know that card(S[Xi <— b\) € [card(S) - 1, card(S) + 1] = 
[cardi(X) — 1, cardi(X) + 1] because only one variable has been flipped. Hence, any 
assignment (S,p) with p > cardi(X) + 1 is a bound support. dom(N) necessarily 
contains such a value p by assumption. 

The only case when pruning might occur is if the variable N is ground and 
cardi (X) = N. Constraints (6) imply that Mid equals the sum of variables Mi^-i + 
M bl , Cl +M Cl+ i tb2 -! . . . + M bN , CN +M CN+lyd . The lower bound of the variable M CiM 
is greater than one and there are \Y \ — cardi (X) = N of these intervals. Therefore, by 
constraint (7), the upper bound of variables M Ci _ 1+ i i b i _i that correspond to intervals 
outside the set Iy are forced to zero. 

There are 0(nd 2 ) constraints (4) and constraints (5) that can be woken 0(d) times 
down the branch of the search tree. Each requires O(l) time for a total of 0(nd 3 ) down 
the branch. There are 0(d 2 ) constraints (6) which can be woken 0(n) times down the 
branch and each invocation takes O(l) time. This gives a total of 0(nd 2 ). The final 
complexity down the branch of the search tree is therefore 0(nd 3 ). □ 

5 Faster decompositions 

We can improve how our solver handles this decomposition of the AtMostNValue 
constraint by adding implied constraints and by implementing specialized propagators. 
Our first improvement is to add an implied constraint and enforce BC on it: 



This does not change the asymptotic complexity of reasoning with the decomposition, 
nor does it improve the level of propagation achieved. However, we have found that the 
fixed point of propagation is reached quicker in practice with such an implied constraint. 

Our second improvement decreases the asymptotic complexity of enforcing BC on 
the decomposition of Section 4. The complexity is dominated by reasoning with con- 
straints (4) which channel from Xi to Au u and thence onto M\ u (through constraints 
(5)). If constraints (4) were not woken uselessly, enforcing BC should cost 0(1) per 



d 




(8) 



constraint down the branch. Unfortunately, existing solvers wake up such constraints 
as soon as a bound is modified, thus a cost in 0(d). We therefore implemented a spe- 
cialized propagator to channel between Xi and M;„ efficiently. To be more precise, we 
remove the 0(nd 2 ) variables Au u and replace them with 0(nd) Boolean variables Zy. 
We then add the following constraints 



Zn = 1 X { < j l<j<d (9) 

Vi)=lV2» = 0VM iu >0 l<l<u<d,l<i<n (10) 

These constraints are enough to channel changes in the bounds of the X variables 
to M\ u . There are 0(nd) constraints (9), each of which can be propagated in time 0(d) 
over a branch, for a total of 0(nd 2 ). There are 0{nd 2 ) clausal constraints (10) and each 
of them can be made BC in time 0(1) down a branch of the search tree, for a total cost 
of 0(nd 2 ). Since channeling dominates the asymptotic complexity of the entire decom- 
position of Section 4, this improves the complexity of this decomposition to 0(nd 2 ). 
This is similar to the technique used in [6] to improve the asymptotic complexity of the 
decomposition of the All-Different constraint. 

Our third improvement is to enforce stronger pruning by observing that when 
Miu = 0, we can remove the interval [I, u] from all variables, regardless of whether 
this modifies their bounds. This corresponds to enforcing RC on constraints (4). Inter- 
estingly, this is sufficient to achieve RC on the AtMostNValue constraint. Unfortu- 
nately, constraints (10) cannot achieve this pruning and using constraints (4) increases 
the complexity of the decomposition back to 0(nd 3 ). We do it by extending the de- 
composition with O(dlogd) Boolean variables B^y^k^ S [0, 1], 1 < i < n, 1 < I < 
d, < k < [log d\ . The following constraint ensures that B^j = 1 Xi = j. 



DOMAINBlTMAPp^, [Bai, • • ■ ,B idd ]) (11) 

Clearly we can enforce RC on this constraint in time 0(d) over a branch, and 0(nd) 
for all variables Xi. We can then use the following clausal constraints to channel from 
variables M;„ to these variables and on to the X variables. These constraints are posted 
for every 1 < i < n, 1 < I < u < d, 1 < j < d and integers k such that < k < 
LlogdJ: 

Bij(j+2 k + 1 ~i) = 1 V B i ^ J+2 k_ 1) = (12) 
-Sii(i+2 fc +!-i) = 1 V Si( :)+ 2 fc )0'+2 fc + 1 -i) = (13) 
M Iu9 4 0VB a(I+2 *_ 1) =0 2 k <u-l + K2 k+1 (14) 

M lu ^0V B l(u _ 2k+l)u = {) 2 k <u-l + K2 k+1 (15) 

The variable Bu( l+2 k _ 1 y similarly to the variables Ai u , is true when Xi s [1,1 + 
2 k — 1], but instead of having one such variable for every interval, we only have them for 
intervals whose length is a power of two. When M lu = 0, with 2 k < u - I + 1 < 2 k+1 , 
the constraints (14)— (15) set to the B variables that correspond to the two intervals of 
length 2 k that start at I and finish at u, respectively. In turn, the constraints (12)— (13) 



set to the B variables that correspond to intervals of length 2 fe ~ 1 , all the way down to 
intervals of size 1. These trigger the constraints (11), so all values in the interval [I, u] 
are removed from the domains of all variables. 

Example. Suppose Xi G [5,9]. Then, by (9), Zu = 0, Z w = 1 and by (10), M 59 > 0. 
Conversely, suppose M59 = and Xi G [1, 10]. Then, by (14)-(15), we get -B158 = 
and £?i69 = 0. From Bi 58 = and (12)-(13) we get Bi 56 = 0, Bus = 0, B155 = 
Bw6 = Bin — B\ss — 0, and by (11), the interval [5, 8] is pruned from X\. Similarly, 
Biqq = causes the interval [6, 9] to be removed from Xi, so Xi G [1, 4] U {10}. 

Note that RC can be enforced on each of these constraints in constant time over a 
branch. There exist 0(nd\ogd) of the constraints (12)— (13) and 0(nd 2 ) of the con- 
straints (14)— (15), so the total time to propagate them all down a branch is 0(ncP). 

6 AtLeastNValue constraint 

There is a similar decomposition for the AtLeastNValue constraint. We introduce 
0/1 variables, Au u to represent whether Xi uses a value in the interval [l,u], and integer 
variables, Ei u with domains [0, n] to count the number of times values in [/, u] are re- 
used, that is, how much the number of variables taking values in [l,u] exceeds the 
number u — I + 1 of values in [I, u].To constrain these introduced variables, we post the 
following constraints: 

Auu = 1 ^> Xi£[l,u] Vl<i<n,l<l<u<d (16) 

Eiu >YTi=iA ilu -{u-l + l) Vl<l<u<d (17) 

E lu = E lk + E (k+1)u Vl<k<u<d (18) 

N<n-E ld (19) 

Running Example. Consider the decomposition of an AtLeastNValue constraint 
over the following variables and values: 





1 2 3 4 5 




* * * * 


x 2 




x 3 


* * * 


Xi 




x 5 




N 





Bound consistency reasoning on the decomposition will make the following inferences. 
As dom(Xi) C [2, 4] for i G 2. .5, from (16) we get = 1 for i G 2. .5. Hence, by 
(17), E 24 > 1. By (18), E 15 = E 14 + E 55 , E 14 = E n +E 2 4. Since E 2i >lwe deduce 
that £15 > 1. Finally, from (19) and the fact that n — 5, we get N < 4. This gives us 
bound consistent domains for the AtLeastN VALUE constraint. 

We now prove that this decomposition does not hinder propagation in general. 

Theorem 4 BC on the constraints (16) to (19) is equivalent to BC on 
AtLeastNValue ([Xi, . . . ,X n ],N), and takes 0(nd 3 ) time to enforce down the 
branch of the search tree. 



Proof: First note that changing the domains of the X variables cannot affect the lower 
bound of N by the AtLeastNValue constraint and, conversely, changing the upper 
bound of N cannot affect the domains of the X variables. 

It is known [3] that card^ (X) is equal to the size of a maximum matching M in the 
value graph of the constraint. Since N < n — E\d, we show that the lower bound of 
Eid is equal ton - |M|. 4 We first show that we can construct a matching M(E) of size 
n — min(Eid), then show that it is a maximum matching. The proof uses a partition of 
the interval [1, d] into a set of maximal saturated intervals / = {[bj, Cj}}, j = 1, . . . , k 
such that min(E bj )C ) = Y^i=i min(A ibjCj ) — {cj — bj + 1) and a set of unsaturated 
intervals {[bj, Cj]}such that mtn(£j jiCj ) = 0. 

Let / = {[6j,Cj] | j <E [1 . . -k}} be the ordered set of maximal intervals such 
that min(E b . Cj ) = Y^ii=i m i n (Ai bjCj ) — i c j — bj + 1). Note that the intervals in 
/ are disjoint otherwise intervals are not maximal. An interval [&,, a] is smaller than 
[&j,Cj] iff q < bj. We denote the union of the first j intervals D\ — U^ =1 [6^ , c^], 
j = [1, . . . , k], p = \Dj \ and the variables whose domain is inside one of intervals I 
X T = {X Pi \D(X Pi ) CD*}. 

Our construction of a matching uses two sets of variables, Xj and X \ Xj. First, we 
identify the cardinality of these two sets. Namely, we show that the size of the set Xj is 
p + min(Ei t d) and the size of the set X \ Xi is n — (p + min(Ei^)). 

Intervals I are saturated therefore each value from these intervals are taken by a 
variable in Xj. Therefore, Xj has size at least p. Moreover, there exist min(Eid) 
additional variables that take values from Dj, because values from intervals be- 
tween two consecutive intervals in I do not contribute to the lower bound of the 
variable E by construction of /. Therefore, the number of variables in is at 
least p + min(Ei^d). Note that constraints (18) imply that E\& equals the sum of 
variables £a ;6l _i + E bltCl + E Cl+Xtb2 -i ... + E bkyCk + E Ck+ltd . As intervals in I 
are disjoint then J2i=i ^"(^^c.) = \Xi\ — p. If \Xj\ > p + min(Ei^d) then 
J2i=i m ^ n {^bi,ci) > min(Ei^) and the lower bound of the variable E\& will be in- 
creased. Hence, [Xj[ = p + min(Ei^). 

Since all these intervals are saturated, we can construct a matching Mj of size p 
using the variables in Xi. The size of X \ Xj is n — p — min(Eid). We show by 
contradiction that we can construct a matching M D _ D k of size n — p — min(Eid) 
using the variables in X \ Xj and the values D — Dj. 

Suppose such a matching does not exist. Then, there exists an interval [b, c] such 
that \ Dj) fl [b,c]\ < ^2 ieX \x! min(A ibc ), i.e., after consuming the values in 
I with variables in Xi, we are left with fewer values in [b, c] than variables whose 
domain is contained in [b, c] . We denote p 1 = \[b,c] Pi Dj\, so that p' is the number of 
values inside the interval [b, c] that are taken by variables in Xj. The total number of 
variables inside the interval [b, c] is greater than or equal to Y^i=i fnin(Ai bc ). The total 
number of variables Xj inside the interval [b, c] equals to p' + min(E b , c ). Therefore, 
J2iex\X! min(A ibc ) < Yh=i min(A ibc ) - p' - min(E btC ). On the other hand, the 
number of values that are not taken by the variables Xj in the interval [b, c] is c — 
b + 1 — p'. Therefore, we obtain the inequality c — 6 + 1 — p' < J27=i m in(A ibc ) — 



4 We assume that E\d is not pruned by other constraints. 



p' — min(Eb vC ) or min(Et c ) < Ym=i m i n {Aibc) — (c — b + 1). By construction of 
I, Ym=\ min(Au, c ) — (c — b + 1) < min(Eb c ), otherwise the intervals in / that are 
subsets of [b, c] are not maximal. This leads to a contradiction, so we can construct a 
matching M{E) of size n — min(E\d). 

Now suppose that M(E) is not a maximum matching. This means that min(Eid) 
is overestimated by propagation on (16) and (19). Since M(E) is not a maximum 
matching, there exists an augmenting path of M(E), that produces M', such that 
\M'\ = \M(E) \ + 1. This new matching covers all the values that M(E) covers and 
one additional value q. We show that q cannot belong to the interval [1, d]. 

The value q cannot be in any interval in /, because all values in [bi, Cj] G / are used 
by variables whose domain is contained in [&,, a]. In addition, q cannot be in an interval 
[b, c] between two consecutive intervals in /, because those intervals do not contribute 
to the lower bound of E\d- Thus, M' cannot cover more values than M(E) and they 
must have the same size, a contradiction. 

We show that when N is BC and dom(N) ^ {card-[ (X)}, all X variables are BC. 
Take any assignment S G IIx i exf'ange(Xi) such that card(S) — card^(X). Let 
S[Xi <— b] be the assignment S where the value of Xi in S has been replaced by b, one 
of the bounds of X^ We know that card(S[Xi <— b]) G [card(S) - 1, card(S) + 1] = 
[card^(X) — 1, card-\(X) + 1] because only one variable has been flipped. Hence, any 
assignment (S,p) with p < card^(X) — 1 is a bound support. dom(N) necessarily 
contains such a value p by assumption. 

We now show that if N = card^(X), enforcing BC on the constraints (16)— (19) 
makes the variables X BC with respect to the AtLeastNValue constraint. We first 
observe that in a bound support, variables X must take the maximum number of dif- 
ferent values because N — card^(X). Hence, in a bound support, variables X that are 
not included in a saturated interval will take values outside any saturated interval they 
overlap and they all take different values. We recall that min{Exd) = n — \M\ = 
n — card^(X). Hence, by constraint (19), Eid = n — N. We recall the the size 
of set Xj equals p + E\d- Constraints (18) imply that E\d equals the sum of vari- 
ables £a ;bl _i +.E&i,ci +.Eci +1,62-1 • • - + Eb k ,c k +E Ck+ i^ d and^ i= i min(E bitCi ) = 
\Xi\ — p = min(Eid) = max(Eid)- Hence, by constraints (18), the upper bounds of 
all variables E^^a that correspond to the saturated intervals are forced to mm(i?; )i . Ci ). 
Thus, by constraints (16) and (17), all variables in X \ Xi have their bounds pruned 
if they belong to Dj. By constraints (18) again, the upper bounds of all variables Ei u 
that correspond to the unsaturated intervals are forced to take value 0, and all variables 
E\i u i with [I', u'] G [l 7 u] are forced to as well. Thus, by constraints (16) and (17), all 
variables in X \ Xj have their bounds pruned if they belong to a Hall interval of other 
variables in X \ Xi. This is what BC on the All-Different constraint does [6]. 

There are 0{nd 2 ) constraints (16) that can be woken 0(d) times down the branch 
of the search tree in O(l), so a total of 0(nd 3 ) down the branch. There are 0(d 2 ) 
constraints (17) which can be propagated in time 0(n) down the branch for a 0{nd 2 ). 
There are O (d 2 ) constraints (18) which can be woken O (n) times each down the branch 
for a total cost in 0(n) time down the branch. Thus a total of 0(nd 2 ). The final com- 
plexity down the branch of the search tree is therefore 0(nd 3 ). □ 



The complexity of enforcing BC on the AtLeastNValue constraint can be im- 
proved to 0(nd 2 ) in way similar to that described in Section 5 and in [6]. 

7 Experimental results 

To evaluate these decompositions, we performed experiments on two problem domains. 
We used the same problems as in a previous experimental comparison of propagators 
for the AtMostN Value constraint [5]. We ran experiments with Hog Solver 6.2 on 
an Intel Xeon 4 CPU, 2.0 Ghz, 4Gb RAM. 

7.1 Dominating set of the Queen's graph 

The problem is to put the minimum number of queens on a n x n chessboard, so that 
each square either contains a queen or is attacked by one. This is equivalent to the 
dominating set problem of the Queen's graph. Each vertex in the Queen's graph cor- 
responds to a square of the chessboard and there exists an edge between two vertices 
iff a queen from one square can attack a queen from the other square. To model the 
problem, we use a variable Xi for each square, and values from 1 to n 2 and post a 
single AtMostNValueQ-Xi, . . . , X n *],N) constraint. The value j belongs to D(Xi) 
iff there exists an edge (i, j) in the Queen's graph or j = i. We use minimum domain 
variable ordering and a lexicographical value ordering. For n < 120, all minimum 
dominating sets for the Queen's problem are either of size \n/2] or \n/2 + 1] [10]. 
We therefore only solved instances for these two values of N. 

We compare our decomposition with two simple decompositions of the 
AtMostNValue constraint. The first decomposition is the one described in Section 
3.1 except that in constraint (3), we replace "=" by "<". We denote this decomposition 
Occs. The second decomposition is similar to the first one, but we use the cardinality 
variables of a GCC constraint to keep track of the used values. We call this decom- 
position OccSgcc- The final two decompositions are variants of the decomposition de- 
scribed in Section 4, which we call Pyramidsc or Pyramidac depending whether 
we enforce BC or RC on our decomposition. As explained in Section 5, we channel 
the variables Xi directly to the pyramid variables M\ u to avoid introducing many aux- 
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iliary variables Au u and we add the redundant constraint Y^i=i Ma — M l n 2 to the 
decomposition to speed up the propagation across the pyramid. For the decomposition 
that enforces RC, we did not fully implement the 0(nd 2 ) decomposition of Section 5, 
but rather a simple channeling propagator that achieves RC in 0(nd 3 ) on (4), but with 
better asymptotic constants than constraints (4). Finally, we re-implemented the ternary 
sum constraint Z = X + Y in Hog. This gave us about 30% speed up. 

Results are presented in Table 1 . Our decomposition performs better than the other 
two decompositions, both in runtime and in number of backtracks. We observe that BC 
and RC prune the same (i.e., same number of backtracks) on our decomposition but BC 
is faster on larger problems. It should be pointed out that our results are comparable 
with the results for the AtMostNValue bounds consistency propagator from [5]. 
Whilst our decomposition is not as efficient as the best results presented in that paper, 
our decomposition was easier to implement. 



Table 1. Backtracks and rumtime (in seconds) to solve the dominating set problem for the 
Queen's graph. Best results for any statistic are bold fonted. 



n 


N 


Occs 


OcCSgcc 


Pyramid bc 


Pyramiduc 




backtracks time 


backtracks time 


backtracks time 


backtracks time 


5 


3 


34 0.01 


34 0.06 


7 0.00 


7 0.00 


6 


3 


540 0.16 


540 2.56 


118 0.03 


118 0.03 


7 


4 


195,212 84.50 


195,212 1681,21 


83,731 15.49 


83,731 21.21 


8 


5 


390,717 255.64 


390,717 8,568.35 


256,582 58.42 


256,582 89.30 



7.2 Random binary CSP problems 

We also reproduced the set of experiments on random binary CSP problems from [5]. 
These problems can be described by four parameters. The number of variables n, the 
domain size d, the number of binary constraints m and the number of forbidden tuples 
in each binary constraint. The first three classes are hard problems at the phase transition 
in satisfiability. The last two classes are under-constrained problems. We add a single 
AtMostNValue constraint over all variables to bound the number of values N that 
can be used in a solution. 

As in [5], we generated 500 instances for each of the following 5 classes: 



- class A 


n 


= 100, d-- 


= 10, m = 


= 250, t = 52, N = 8 


- class B 


n 


= 50, d = 


15, m = 


120, t = 116, N = 6 


- class C 


n 


= 40, d = 


20, m = 


80, t= 240, N = 6 


- class D 


n 


= 200, d-- 


= 15, m = 


= 600, t = 85,N = 8 


- class E : 


n 


= 60, d = 


30, m = 


150,^ = 350,^ = 6 



All instances are solved using the minimum domain variable ordering heuristic, a 
lexicographical value ordering and a timeout of 600 seconds. We use the same decom- 
positions of the AtMostNValue constraint as in the experiments with the dominating 
set of the Queen's graph. Results are given in Table 2. On classes A, B, C (hard prob- 
lems), our decomposition is faster than the other two decompositions and solves more 
instances whatever we use BC or RC. On classes D, E (under-constrained problems), 
enforcing BC on our decomposition does not prune the search space enough. This leads 
to a high number of backtracks and a significant slow down. Our decomposition with 
RC is again better than the other decompositions. 

These experiments demonstrate that this new decomposition is efficient to use in 
practice. Of course, if the toolkit contains a specialized BC propagator for the N Value 
constraint, we will probably do best to use this. However, when the toolkit lacks such a 
propagator (as is often the case), it is reasonable to try out our decomposition. 

8 Other related work 

Decompositions have been given for a number of other global constraints. For example, 
Beldiceanu et al. identify conditions under which global constraints specified as au- 
tomata can be decomposed into signature and transition constraints without hindering 



Table 2. Randomly generated binary CSPs with an AtMostNValue constraint. For each class 
we give two lines of results. Line 1: number of instances solved in 600 sec (#solved), average 
backtracks on solved instances (#bt), average time on solved instances (time). Line 2: number of 
instances solved by all methods, average backtracks and time on these instances. Best results for 
any statistic are bold fonted. 





Occs 


OCCS gcc 


Pyraraid.BC 


Pyramidnc 


Class 


#solved #bt time 


#solved #bt time 


#solved #bt time 


#solved #bt time 


A total solved 
solved by all 


453 139,120 111.2 
79 8,960 7.1 


79 8,960 302.8 
79 8,960 302.8 


463 168,929 101.8 
79 9,104 5.7 


462 148,673 105.7 
79 8,739 6.3 


B total solved 
solved by all 


473 228,757 113.5 
125 7,377 17.6 


125 37,377 292.9 
125 37,377 292.9 


492 224,862 89.0 
125 32,810 10.9 


491 235,715 94.9 
125 32,110 12.2 


C total solved 
solved by all 


479 233,341 110.3 
156 37242 16.4 


156 37,242 290.3 
156 37,242 290.3 


492 234,915 79.5 
156 32,184 9.7 


490 224,802 84.2 
156 31,715 11.1 


D total solved 
solved by all 


482 8,306 6.0 
391 195 0.2 


456 207 14.9 
391 195 13.1 


416 168,021 24.2 
391 145,534 14.9 


489 13,776 9.0 

391 690 0.4 


E total solved 
solved by all 


500 331 0.3 

500 331 0.3 


500 331 5.1 
500 331 5.1 


500 4,252 0.4 
500 4,252 0.4 


500 174 0.1 
500 174 0.1 


TOTALS 
Total solved/tried 
Avg time for solved 
Avg bts for solved 


2,387/2,500 
67.0 
120,303 


1,316/2,500 
87.5 
8,700 


2,363 /2,500 
59.364 
163,473 


2,432/2,500 
58.0 
123,931 



propagation [11]. As a second example, many global constraints can be decomposed 
using Roots and Range which can themselves often be propagated effectively using 
simple decompositions [12-14]. As a third example, the REGULAR and Cfg constraints 
can be decomposed without hindering propagation [15-17]. As a fourth example, de- 
compositions of the Sequence constraint have been shown to be effective [18]. Most 
recently, we demonstrated that the All-Different and GCC constraint can be de- 
composed into simple primitive constraints without hindering bound consistency prop- 
agation [6]. These decompositions also introduced variables to count variables using 
values in an interval. For example, the decomposition of All-Different ensures that 
no interval has more variables taking values in the interval than the number of values 
in the interval. Using a circuit complexity lower bound, we also proved that there is 
no polynomial sized SAT decomposition of the All-Different constraint (and there- 
fore of its generalizations like N Value) on which unit propagation achieves domain 
consistency [19]. 

9 Conclusions 

We have shown that the N Value constraint can be decomposed into simple arithmetic 
constraints. This decomposition permits a global view to be maintained that achieves 
bound consistency. Our experiments demonstrate that this decomposition is a relatively 
efficient and effective means to propagate the global NValue constraint. Decomposi- 



tions of global constraints like this are interesting for a number of other reasons. First 
they provide fresh insight into the workings of specialized propagation algorithms. In 
this case, it is surprising that interval graph reasoning used to enforce bound consis- 
tency on the NValue can be simulated with simple arithmetic constraints. Second, 
such decompositions may make nogood learning easier to implement. We can identify 
compact nogoods with different parts of the decomposition. Third, the variables intro- 
duced in such decompositions give ths solver access to the state of the propagator. This 
may be useful when making branching decisions. Fourth, such decompositions can of- 
ten be encoded effectively into SAT and linear inequalities. We can thereby provide 
the power of global propagation algorithms to SAT and IP solvers. Finally, we expect 
many other global constraints that count variables and values to be decomposable in 
similar ways. For instance, it is known that BC on the SoftAllDifferent constraint 
is equivalent to BC on the AtLeastN Value constraint [5]. We are currently studying 
decompositions of other global constraints, such as the Used-By [20] and other soft 
global constraints such as SOFTGCC [21]. 
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